Norwegian Speech Recognition for Telephone Applications

نویسندگان

  • Ingunn Amdal
  • Finn Tore Johansen
چکیده

In this paper we present a Norwegian tele phone speech database TABU We discuss the database design speci cation and some ex periences with recording and labelling of the database We also present some preliminary re sults with a word based recogniser trained on a subset of the database

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Norwegian numerals: a challenge to automatic speech recognition

This paper addresses the problem of speaker-independent connected numeral recognition over telephone lines. Increasing the vocabulary from digits (0-9) to numerals (0-99) opens for more user-friendly services, but it also introduces many new, language-specific problems. This paper investigates morphological, phonemic and allophonic variations in the pronunciation of numerals in Norwegian. If im...

متن کامل

Improving Performance of Telephone- Based Mandarin Speech Recognition

Since telephone is the only ubiquitous communications device in current world, it is the largest potential application field for speech techniques. Telephony speech recognition is a core technique for such telephone-based speech applications. It is well known that the bandwidth of telephone line is limited to 300~3400Hz and there are many inherent variations within the telephone network. All th...

متن کامل

A Comparative Study of Gender and Age Classification in Speech Signals

Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...

متن کامل

Automatic Classification and Transcription of Telephone Speech in Radio Broadcast Data

Automatic transcription of telephone speech involves additional challenges compared to wideband data processing, mainly due to channel limitations and to particular characteristics of conversational telephone speech. While in TV speech recognition applications, such as automatic transcription of broadcast news, the presence of telephone data is nearly insignificant (less than 1 %), in most radi...

متن کامل

POLYCOST: A telephone-speech database for speaker recognition

This article presents an overview of the POLYCOST database dedicated to speaker recognition applications over the telephone network. The main characteristics of this database are: large mixed speech corpus size (> 100 speakers), English spoken by foreigners, mainly digits with some free speech, collected through international telephone lines, and more than eight sessions per speaker.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994